A robust algorithm for separation of Chinese characters from line drawings
Identifieur interne : 002710 ( Main/Exploration ); précédent : 002709; suivant : 002711A robust algorithm for separation of Chinese characters from line drawings
Auteurs : Liang-Hua Chen [République populaire de Chine] ; Jiing-Yuh Wang [République populaire de Chine] ; Hong-Yuan Liao [République populaire de Chine] ; Kuo-Chin Fan [République populaire de Chine]Source :
- Image and Vision Computing [ 0262-8856 ] ; 1995.
Abstract
Separating characters from graphics is an important step towards automatic document understanding. In this paper, we propose a robust algorithm to separate Chinese characters from graphics. Our approach is based on clustering the feature points in an image. Two remedy procedures are also proposed to solve the problems caused by the thinning process. This will obtain a better localization of feature points and improve the performance of the separation process. Using our algorithm, all Chinese characters can be separated from graphics without regard to the font style or orientation of the character. Furthermore, our algorithm can also handle the serious case where characters touch/cross lines. The proposed algorithm has been successfully tested on several kinds of line drawings, such as land register maps and form documents.
Url:
DOI: 10.1016/0262-8856(96)01081-5
Affiliations:
Links toward previous steps (curation, corpus...)
- to stream Istex, to step Corpus: 000915
- to stream Istex, to step Curation: 000905
- to stream Istex, to step Checkpoint: 001B21
- to stream Main, to step Merge: 002854
- to stream Main, to step Curation: 002710
Le document en format XML
<record><TEI wicri:istexFullTextTei="biblStruct"><teiHeader><fileDesc><titleStmt><title>A robust algorithm for separation of Chinese characters from line drawings</title>
<author><name sortKey="Chen, Liang Hua" sort="Chen, Liang Hua" uniqKey="Chen L" first="Liang-Hua" last="Chen">Liang-Hua Chen</name>
</author>
<author><name sortKey="Wang, Jiing Yuh" sort="Wang, Jiing Yuh" uniqKey="Wang J" first="Jiing-Yuh" last="Wang">Jiing-Yuh Wang</name>
</author>
<author><name sortKey="Liao, Hong Yuan" sort="Liao, Hong Yuan" uniqKey="Liao H" first="Hong-Yuan" last="Liao">Hong-Yuan Liao</name>
</author>
<author><name sortKey="Fan, Kuo Chin" sort="Fan, Kuo Chin" uniqKey="Fan K" first="Kuo-Chin" last="Fan">Kuo-Chin Fan</name>
</author>
</titleStmt>
<publicationStmt><idno type="wicri:source">ISTEX</idno>
<idno type="RBID">ISTEX:B17A96634211979F6A91F70091C630F9EB4280AE</idno>
<date when="1996" year="1996">1996</date>
<idno type="doi">10.1016/0262-8856(96)01081-5</idno>
<idno type="url">https://api.istex.fr/document/B17A96634211979F6A91F70091C630F9EB4280AE/fulltext/pdf</idno>
<idno type="wicri:Area/Istex/Corpus">000915</idno>
<idno type="wicri:Area/Istex/Curation">000905</idno>
<idno type="wicri:Area/Istex/Checkpoint">001B21</idno>
<idno type="wicri:doubleKey">0262-8856:1996:Chen L:a:robust:algorithm</idno>
<idno type="wicri:Area/Main/Merge">002854</idno>
<idno type="wicri:Area/Main/Curation">002710</idno>
<idno type="wicri:Area/Main/Exploration">002710</idno>
</publicationStmt>
<sourceDesc><biblStruct><analytic><title level="a">A robust algorithm for separation of Chinese characters from line drawings</title>
<author><name sortKey="Chen, Liang Hua" sort="Chen, Liang Hua" uniqKey="Chen L" first="Liang-Hua" last="Chen">Liang-Hua Chen</name>
<affiliation wicri:level="1"><country xml:lang="fr" wicri:curation="lc">République populaire de Chine</country>
<wicri:regionArea>Department of Computer Science and Information Engineering, Fu Jen University, HsinChuang, Taipei, Taiwan</wicri:regionArea>
<wicri:noRegion>Taiwan</wicri:noRegion>
</affiliation>
</author>
<author><name sortKey="Wang, Jiing Yuh" sort="Wang, Jiing Yuh" uniqKey="Wang J" first="Jiing-Yuh" last="Wang">Jiing-Yuh Wang</name>
<affiliation wicri:level="1"><country xml:lang="fr" wicri:curation="lc">République populaire de Chine</country>
<wicri:regionArea>Institute of Information and Electrical Engineering, National Central University, Chung-Li, Taiwan</wicri:regionArea>
<wicri:noRegion>Taiwan</wicri:noRegion>
</affiliation>
</author>
<author><name sortKey="Liao, Hong Yuan" sort="Liao, Hong Yuan" uniqKey="Liao H" first="Hong-Yuan" last="Liao">Hong-Yuan Liao</name>
<affiliation wicri:level="1"><country xml:lang="fr" wicri:curation="lc">République populaire de Chine</country>
<wicri:regionArea>Institute of Information Science, Academia Sinica, Taipei, Taiwan</wicri:regionArea>
<wicri:noRegion>Taiwan</wicri:noRegion>
</affiliation>
</author>
<author><name sortKey="Fan, Kuo Chin" sort="Fan, Kuo Chin" uniqKey="Fan K" first="Kuo-Chin" last="Fan">Kuo-Chin Fan</name>
<affiliation wicri:level="1"><country xml:lang="fr" wicri:curation="lc">République populaire de Chine</country>
<wicri:regionArea>Institute of Information and Electrical Engineering, National Central University, Chung-Li, Taiwan</wicri:regionArea>
<wicri:noRegion>Taiwan</wicri:noRegion>
</affiliation>
</author>
</analytic>
<monogr></monogr>
<series><title level="j">Image and Vision Computing</title>
<title level="j" type="abbrev">IMAVIS</title>
<idno type="ISSN">0262-8856</idno>
<imprint><publisher>ELSEVIER</publisher>
<date type="published" when="1995">1995</date>
<biblScope unit="volume">14</biblScope>
<biblScope unit="issue">10</biblScope>
<biblScope unit="page" from="753">753</biblScope>
<biblScope unit="page" to="761">761</biblScope>
</imprint>
<idno type="ISSN">0262-8856</idno>
</series>
<idno type="istex">B17A96634211979F6A91F70091C630F9EB4280AE</idno>
<idno type="DOI">10.1016/0262-8856(96)01081-5</idno>
<idno type="PII">0262-8856(96)01081-5</idno>
</biblStruct>
</sourceDesc>
<seriesStmt><idno type="ISSN">0262-8856</idno>
</seriesStmt>
</fileDesc>
<profileDesc><textClass></textClass>
<langUsage><language ident="en">en</language>
</langUsage>
</profileDesc>
</teiHeader>
<front><div type="abstract" xml:lang="en">Separating characters from graphics is an important step towards automatic document understanding. In this paper, we propose a robust algorithm to separate Chinese characters from graphics. Our approach is based on clustering the feature points in an image. Two remedy procedures are also proposed to solve the problems caused by the thinning process. This will obtain a better localization of feature points and improve the performance of the separation process. Using our algorithm, all Chinese characters can be separated from graphics without regard to the font style or orientation of the character. Furthermore, our algorithm can also handle the serious case where characters touch/cross lines. The proposed algorithm has been successfully tested on several kinds of line drawings, such as land register maps and form documents.</div>
</front>
</TEI>
<affiliations><list><country><li>République populaire de Chine</li>
</country>
</list>
<tree><country name="République populaire de Chine"><noRegion><name sortKey="Chen, Liang Hua" sort="Chen, Liang Hua" uniqKey="Chen L" first="Liang-Hua" last="Chen">Liang-Hua Chen</name>
</noRegion>
<name sortKey="Fan, Kuo Chin" sort="Fan, Kuo Chin" uniqKey="Fan K" first="Kuo-Chin" last="Fan">Kuo-Chin Fan</name>
<name sortKey="Liao, Hong Yuan" sort="Liao, Hong Yuan" uniqKey="Liao H" first="Hong-Yuan" last="Liao">Hong-Yuan Liao</name>
<name sortKey="Wang, Jiing Yuh" sort="Wang, Jiing Yuh" uniqKey="Wang J" first="Jiing-Yuh" last="Wang">Jiing-Yuh Wang</name>
</country>
</tree>
</affiliations>
</record>
Pour manipuler ce document sous Unix (Dilib)
EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/Main/Exploration
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 002710 | SxmlIndent | more
Ou
HfdSelect -h $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd -nk 002710 | SxmlIndent | more
Pour mettre un lien sur cette page dans le réseau Wicri
{{Explor lien |wiki= Ticri/CIDE |area= OcrV1 |flux= Main |étape= Exploration |type= RBID |clé= ISTEX:B17A96634211979F6A91F70091C630F9EB4280AE |texte= A robust algorithm for separation of Chinese characters from line drawings }}
This area was generated with Dilib version V0.6.32. |